BIOTECHNOLOGY ra-O- 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number 
Source: 

Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help(S>uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help(o)uspto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3,0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE &ELOW: 



Checker Version 3.0 

The Checker Version 3 .0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1. 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST 25 

Checker Version 3 0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO) 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
tistirigs, thfcs saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://wwW.uspto.gov/web/offices/pac/checker 



• # 



Raw Sequence Lb ting Error Summary 



ERROR DETECTED 



SUGGESTED CORRECTION 



SERIAL NUMBER: 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



1 Wrapped Nucleics 

Wrapped Aminos 



The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping," 



Invalid Line Length The rules require that a line not exceed 72 characters in length, this includes white spaces. 



3 Misaligned Amino 

Numbering 

A Non-ASCII 



Variable Length 



6 Patentln 2.0 

"bug** 



7 Skipped Sequences 

(OLD RULES) 



8 Skipped Sequences 

(NEW RULES) 



9 Use of n's or Xaa's 
(NEW RULES) 



10 Invalid <213> 

Response 

11 Useof<220> 



12 Patentln 2.0 

"bug" 



The numbering under each 5* amino acid is misaligned Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission b saved In ASCII text. 

Sequences) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused uTe <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence Please manually copy th« relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



Sequencers) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



Sequencers) _ 



missing. If intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 

000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents 

Per 1.823 of Sequence Rules, the only valid <2 13> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/spocies). <220>-<223> section is required when <2 13> response is Unknown or 
is Artificial Sequence 



Sequences) 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <2 1 3> "Organism" response is "Artificial Sequence" or 
"Unknown/' Please explain source of genetic material in <220> to <223> section. 
(See "Federal Regrster," 06/01/1998, Vol. 63, No 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 • Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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PCT09 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/0 9/7 4 3,24 7 



DATE: 08/03/2001 
TIME: 16:29:03 



Input Set : A:\ES.txt 

Output Set: N:\CRF3\08032001\I743247.raw 



<110> 
<120> 



APPLICANT: Sagami Chemical Research Center; Protegene Inc. 

TITLE OF INVENTION: Human Proteins Having Hydrophobic Domains And DNAs Encoding 



These 



Proteins 



8 


<130> 


FILE 


REFERENCE: 661102 








<140> 


CURRENT APPLICATION NUMBER 


: US/09/743, 247 


-> 10 


<141> 


CURRENT FILING DATE: 1999- 


07-22 




10 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


JP 10- 


208820 


11 


<151> 


PRIOR 


FILING DATE 


: 1998-07 


-24 




13 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


JP 10- 


224105 


14 


<151> 


PRIOR 


FILING DATE 


: 1998-08 


-07 




16 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


JP 10- 


238116 


17 


<151> 


PRIOR 


FILING DATE 


: 1998-08 


-25 




19 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


JP 10- 


254736 


20 


<151> 


PRIOR 


FILING DATE 


: 1998-09 


-09 




2 2 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


JP 10- 


275505 


23 


<151> 


PRIOR 


FILING DATE 


: 1998-09 


-29 




25 


<160> 


NUMBER OF SEQ ID NOS : 150 






27 


<170> 


SOFTWARE: Windows 


9 5 (Word 


98) 





4+ 

r 



6 



ERRORED SEQUENCES 



519 


<210> SEQ ID NO 


: 21 


























520 


<211> LENGTH :Cb 


10) < 




U( 


' i 


) 


















521 


<212> TYPE: 


DN?T 


























522 


<213> ORGANISM: 


Homo sapiens 




















> 523 


<220> FEATURE: 




























524 


<221> NAME/KEY: 


CDS 


























525 


<222> LOCATION: 


(66) 




(443 ) 






















527 


<400> SEQUENCE: 


21 


























528 


acgcttgatc 


cccggccgcg gggccaggaa gtcggagttt 


gagccccgga 


ggcagagegg 


60 


529 


ctgcc atg gcc aag tac ctg gcc cag ate att gtg atg ggc gtg cag gtg 


110 


530 




Met Ala Lys Tyr Leu Ala Gin lie lie Val Met Gly Val Gin Val 




531 






1 








5 








10 








15 




532 


gtg 


ggc 


agg 


gcc 


ttt 


gca 


egg 


gcc 


ttg 


egg 


cag 


gag 


ttt 


gca 


gcc 


age 


158 


533 


Val 


Gly 


Arg 


Ala 


Phe 


Ala 


Arg 


Ala 


Leu 


Arg 


Gin 


Glu 


Phe 


Ala 


Ala 


Ser 




534 










20 










25 










30 






535 


egg 


gcc 


gca 


get 


gat 


gcc 


cga 


gga 


cgc 


get 


gga 


cac 


egg 


tct 


gca 


gcc 


206 


536 


Arg 


Ala 


Ala 


Ala 


Asp 


Ala 


Arg 


Gly 


Arg 


Ala 


Gly 


His 


Arg 


Ser 


Ala 


Ala 




537 








35 










40 










45 








538 


get 


tec 


aac 


etc 


tec 


ggc 


etc 


age 


etc 


cag 


gag 


gca 


cag 


cag 


att 


etc 


254 


539 


Ala 


Ser 


Asn 


Leu 


Ser 


Gly 


Leu 


Ser 


Leu 


Gin 


Glu 


Ala 


Gin 


Gin 


He 


Leu 




540 






50 










55 










60 










541 


aac 


gtg 


tec 


aag 


ctg 


age 


cct 


gag 


gag 


gtc 


cag 


aag 


aac 


tat 


gaa 


cac 


302 


542 


Asn 


Val 


Ser 


Lys 


Leu 


Ser 


Pro 


Glu 


Glu 


Val 


Gin 


Lys 


Asn 


Tyr 


Glu 


His 




543 




65 










70 










75 












544 


tta 


ttt 


aag 


gtg 


aat 


gat 


aaa 


tec 


gtg 


ggt 


ggc 


tec 


tte 


tac 


ctg 


cag 


350 
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RAW SEQUENCE LISTING DATE: 08/03/2001 

PATENT APPLICATION: US/09/74 3,24 7 TIME: 16:29:03 

Input Set : A:\ES.txt 

Output Set: N:\CRF3\08032001\I743247.raw 

545 Leu Phe Lys Val Asn Asp Lys Ser Val Gly Gly Ser Phe Tyr Leu Gin 

546 80 85 90 95 

547 tea aag gtg gtc cgc gca aag gag cgc ctg gat gag gaa etc aaa ate 398 

548 Ser Lys Val Val Arg Ala Lys Glu Arg Leu Asp Glu Glu Leu Lys lie 

549 100 105 110 , ^ 
E--> 550 cag gec cag gag gac aga gaa aaa ggg cag atg ccc cat acg tgactgetc /4 50 rjll 

551 Gin Ala Gin Glu Asp Arg Glu Lys Gly Gin Met Pro His Thr ^ ' 

552 115 120 125 /"Ov 
E--> 553 gctccccccg cccaccccgc cgcctctaat ttatagcttg gtaataaatt tettttctge 

3553 <210> SEQ ID NO: 93 

3554 <211> LENGTH : 195 

3555 <212> TYPE: PRT 

3556 < 213> ORGANISM: Homo (sapience 
3558 <400> SEQUENCE : 93 



E--> 



3559 


Met 


Arg 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Val 


Ala 


Ala 


Ser 


Ala 


Met 


Val 


Arg 


3560 


1 








5 










10 










15 




3561 


Ser 


Glu 


Ala 


Ser 


Ala 


Asn 


Leu 


Gly 


Gly 


Val 


Pro 


Ser 


Lys 


Arg 


Leu 


Lys 


3562 








20 










25 










30 






3563 


Met^ 


Gin 


Tyr 


Ala 


Thr 


Gly 


Pro 


Leu 


Leu 


Lys 


Phe 


Gin 


He 


Cys 


Val 


Ser 


3564/ 






35 










40 










45 








3565 


xaa 


iGly 


Tyr 


Arg 


Arg 


Val 


Phe 


Glu 


Glu 


Tyr 


Met 


Arg 


Val 


He 


Ser 


Gin 


356^ 




' 50 










55 










60 










3567 


Arg 


Tyr 


Pro 


Asp 


He 


Arg 


He 


Glu 


Gly 


Glu 


Asn 


Tyr 


Leu 


Pro 


Gin 


Pro 


3568 


65 










70 










75 










80 


3569 


He 


Tyr 


Arg 


His 


He 


Ala 


Ser 


Phe 


Leu 


Ser 


Val 


Phe 


Lys 


Leu 


Val 


Leu 


3570 










85 










90 










95 




3571 


He 


Gly 


Leu 


He 


He 


Val 


Gly 


Lys 


Asp 


Pro 


Phe 


Ala 


Phe 


Phe 


Gly 


Met 


3572 








100 










105 










110 






3573 


Gin 


Ala 


Pro 


Ser 


He 


Trp 


Gin 


Trp 


Gly 


Gin 


Glu 


Asn 


Lys 


Val 


Tyr 


Ala 


3574 






115 










120 










125 








3575 


Cys 


Met 


Met 


Val 


Phe 


Phe 


Leu 


Ser 


Asn 


Met 


He 


Glu 


Asn 


Gin 


Cys 


Met 


3576 




130 










135 










140 










3577 


Ser 


Thr 


Gly 


Ala 


Phe 


Glu 


He 


Thr 


Leu 


Asn 


Asp 


Val 


Pro 


Val 


Trp 


Ser 


3578 


145 










150 










155 










160 


3579 


Lys 


Leu 


Glu 


Ser 


Gly 


His 


Leu 


Pro 


Ser 


Met 


Gin 


Gin 


Leu 


Val 


Gin 


He 


3580 










165 










170 










175 




3581 


Leu 


Asp 


Asn 


Glu 


Met 


Lys 


Leu 


Asn 


Val 


His 


Met 


Asp 


Ser 


He 


Pro 


His 



3582 180 185 190 

3583 His Arg Ser 

3584 195 

4525 <210> SEQ ID NO: 116 

4526 <211> LENGTH: 1357 

4527 <212> TYPE: DNA 

4528 <213> ORGANISM: Home 




V^Jt> 4529 <220> FEATURE: \ } 



4530 <221> NAME/KEY 

4531 <222> LOCATION 
4533 <400> SEQUENCE 



CDS 

(81) . . . (1262) 
116 



(510; S^i 



c/y\ £w ^ 



>V-W-rf^ 1 



4534 cgtgcgtttg tggccgtccg gcctccctga catgcagccc tctggacccc gaggttggac 60 
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RAW SEQUENCE LISTING DATE: 08/03/2001 

PATENT APPLICATION: US/0 9/74 3,24 7 TIME: 16:29:04 



Input Set : A:\ES.txt 

Output Set: N:\CRF3\08032001\I743247.raw 



4 5 3 5 


cctactgtga 


cacacctacc atg egg aca etc ttc aac etc etc tgg ctt 


111) 


4 5 36 












Met Arg Thr Leu Phe Asn Leu Leu Trp Leu 




4537 














I 








D 








10 




4 5 38 


qcc 


c tg 


gec 


tgc 


age 


cct 


gtt 


cac 


act 


acc 


ctg 


tea 


aag 


tea 


ga r 


gec 


1 c: o 

Ijo 


4 5 3 9 


Ala 


Leu 


Ala 


Cys 


Ser 


Pro 


vai 


HIS 


i nr 


T Ui v- 

l nr 


Leu 


C r 


Lys 


Ser 


Asp 


Ala 




4 54 0 










I D 










z U 










2 5 






E- - > 4 541 


aaa 


aaa 


gec 


gec 


tea 


aag 


acg 


ctg 


ctg 


gag 


aag 


agt 


cag 


ttt 


tea 


gat 


/ 2 0 3 


4 5 4 2 


Lys 


Lys 


Ala 


Ala 


Ser 


Lys 


Thr 


Leu 


Leu 


GlU 


Lys 


Ser 


Gin 


Phe 


Ser 


Asp 


V - 


454 3 








3 (J 










3 D 










4 U 








4 54 4 


aag 


ccg 


gtg 


caa 


gac 


egg 


ggt 


ttg 


gtg 


gtg 


acg 


gac 


etc 


aaa 


get 


gag 


2 5 4 


4 54 5 


Lys 


Pro 


Val 


Gin 


Asp 


Arg 


Gly 


Leu 


vai 


vai 


Thr 


Asp 


Leu 


Lys 


Ala 


Glu 




4 546 






4 D 










5 0 










5 5 










4 54 7 


agt 


gtg 


gtt 


ctt 


gag 


cat 


cgc 


age 


tac 


tgc 


teg 


gca 


aag 


gec 


egg 


gac 


3 0 2 


4 54 8 


Ser 


Val 


Val 


Leu 


Glu 


His 


Arg 


Ser 


Tyr 


Cys 


Ser 


Ala 


Lys 


Ala 


Arg 


Asp 




4549 




60 










6 5 










70 












4 5 50 


aga 


cac 


ttt 


get 


ggg 


gat 


gta 


ctg 


ggc 


tat 


gtc 


act 


cca 


tgg 


aac 


age 


3 5 0 


4 551 


Arg 


His 


Phe 


Ala 


Gly 


Asp 


Val 


Leu 


Gly 


Tyr 


val 


Thr 


Pro 


Trp 


Asn 


Ser 




4 552 


*7 cr 










80 










O cr 
O D 










90 




4 55 3 


cat 


ggc 


tac 


gat 


gtc 


acc 


aag 


gtc 


ttt 


ggg 


age 


aag 


ttc 


aca 


cag 


ate 


3 9 8 


4 554 


His 


Gly 


Tyr 


Asp 


Val 


Thr 


Lys 


Val 


Phe 


Gly 


Ser 


Lys 


Phe 


Thr 


Gin 


He 




4 555 










9 D 










100 










10 5 






4 556 


tea 


ccc 


gtc 


tgg 


ctg 


cag 


ctg 


aag 


aga 


cgt 


ggc 


cgt 


gag 


atg 


ttt 


gag 


4 4 6 


4 557 


Ser 


Pro 


Va 1 


Trp 


Leu 


Gin 


Leu 


Lys 


Arg 


Arg 


Gly 


Arg 


Glu 


Met 


Phe 


Glu 




4558 








110 










115 










120 








4 559 


gtc 


acg 


ggc 


etc 


cac 


gac 


gtg 


gac 


caa 


ggg 


tgg 


atg 


cga 


get 


gtc 


agg 


4 94 


4560 


Val 


Thr 


Gly 


Leu 


His 


Asp 


Va 1 


Asp 


Gin 


Gly 


Trp 


Met 


Arg 


Ala 


Val 


Arg 




4561 






12 5 










130 










135 










4 562 


aag 


cat 


gec 


aag 


ggc 


ctg 


cac 


ata 


gtg 


cct 


egg 


etc 


ctg 


ttt 


gag 


gac 


54 2 


4 563 


Lys 


His 


Ala 


Lys 


Gly 


Leu 


His 


lie 


Val 


Pro 


Arg 


Leu 


Leu 


Pne 


Glu 


Asp 




4564 




14 0 










145 










150 












4 56 5 


tgg 


act 


tac 


gat 


gat 


ttc 


egg 


aac 


gtc 


tta 


gac 


agt 


gag 


gat 


gag 


ata 


590 


4566 


Trp 


Thr 


Tyr 


Asp 


Asp 


Phe 


Arg 


Asn 


Va l 


Leu 


Asp 


Ser 


Glu 


Asp 


Glu 


He 




4 567 


155 










160 










165 










170 




4 568 


gag 


gag 


ctg 


age 


aag 


acc 


gtg 


gtc 


cag 


gtg 


gca 


aag 


aac 


cag 


cat 


ttc 


63 8 


4 569 


Glu 


Glu 


Leu 


Ser 


Lys 


Thr 


Va 1 


va 1 


Gin 


va 1 


Ala 


Lys 


Asn 


Gin 


His 


Phe 




4570 










1 "7 CT 
1 / D 










180 










185 






4 571 


gat 


ggc 


ttc 


gtg 


gtg 


gag 


gtc 


tgg 


aac 


cag 


ctg 


eta 


age 


cag 


aag 


cgc 


6 8 6 


4572 


Asp 


Gly 


Phe 


Val 


Val 


Glu 


Val 


Trp 


Asn 


Gin 


Leu 


Leu 


Ser 


Gin 


Lys 


Arg 




4573 








190 










195 










200 








4574 


gtg 


ggc 


etc 


ate 


cac 


atg 


etc 


acc 


cac 


ttg 


gec 


gag 


get 


ctg 


cac 


cag 


734 


4575 


Val 


Gly 


Leu 


lie 


His 


Met 


Leu 


Thr 


His 


Leu 


Ala 


Glu 


Ala 


Leu 


His 


Gin 




4576 






205 










210 










215 










4577 


gec 


egg 


ctg 


ctg 


gec 


etc 


ctg 


gtc 


ate 


ccg 


cct 


gee 


ate 


acc 


ccc 


ggg 


782 


4578 


Ala 


Arg 


Leu 


Leu 


Ala 


Leu 


Leu 


Val 


He 


Pro 


Pro 


Ala 


He 


Thr 


Pro 


Gly 




4579 




220 










225 










230 












4580 


acc 


gac 


cag 


ctg 


ggc 


atg 


ttc 


acg 


cac 


aag 


gag 


ttt 


gag 


cag 


ctg 


gee 


830 


4581 


Thr 


Asp 


Gin 


Leu 


Gly 


Met 


Phe 


Thr 


His 


Lys 


Glu 


Phe 


Glu 


Gin 


Leu 


Ala 




4 582 


235 










240 










245 










250 




4583 


ccc 


gtg 


ctg 


gat 


ggt 


ttc 


age 


etc 


atg 


acc 


tac 


gac 


tac 


tct 


aca 


gcg 


878 
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RAW SEQUENCE LISTING DATE: 08/03/2001 

PATENT APPLICATION: US/09/74 3 , 24 7 TIME: 16:29:04 



Input Set : A:\ES.txt 

Output Set: N:\CRF3\08032001\I743247.raw 



1 cz Q n 


Pro 


V 3 1 


Leu 


Asp 


<j ly 


D V-i ("i 


Ser 


Leu 


Met 


i nr 


Tyr 


Asp 


ly r 


Ser 


i nr 


Ala 




« r O r 
4 3 O 3 










n r c 
Z 3 3 










3 £ n 
Z 0 U 










1 £ C 

ZD J 






4 C Q /" 

4 Dot) 


CdL 


cag 


c c t 


ggc 


cc t 


=> => -t- 


g c a 


cc c 


ctg 


tec 


tgg 


gtt 


cga 


gee 


tgc 


gtc 


y z d 


A C O 1 

4 DO / 


His 


bin 


Pro 


G 1 y 


Pro 


Asn 


A 1 -a 

Ala 


Pro 


Leu 


Ser 


Trp 


Val 


Arg 


A 1 -a 

Ala 


Cys 


Va l 




1 c o o 
4 3 o o 








1 1 n 
Z 1 U 










Z 1 3 










o q n 
zoU 








4 j o y 


cag 


gtc 


ctg 


gac 


ccg 


aag 


tec 


aag 


tgg 


cga 


age 


aaa 


a l c 


etc 


ctg 


ggg 


Q 7 1 

y / 4 


4 d y u 


Gin 


val 


Leu 


Asp 


Pro 


Lys 


Ser 


Lys 


Trp 


Arg 


Ser 


L y s 


T 1 A 

lie 


Leu 


Leu 


(j ly 




4 5 91 






O o tz 
2 O 3 










inn 

z y u 










one; 

z y 3 










ICQ) 

4 3 y z 


etc 


aac 


LLC 


LdL 


ggt 


acg 


gac 


Lac 


gcg 


acc 


tec 


aag 


ga l 


g cc 


eg t 


gag 


i n 3 o 
1 u z z 


4 5 9 3 


Leu 


Asn 


Phe 


Tyr 


Gly 


Met 


Asp 


Tyr 


Ala 


Thr 


Ser 


Lys 


Asp 


Ala 


Arg 


Glu 




4 5 94 




300 










3 U 3 










Tin 
310 












1 c; Q R 

4 d y d 


cc t 


gtt 


gtc 


ggg 


gec 


agg 


Lac 


aLc 


cag 


aca 


ctg 


aag 


gac 


cac 


agg 


ccc 


1 U / u 


4 d y b 


Pro 


va l 


Va 1 


Gly 


Ala 


Arg 


Tyr 


lie 


Gin 


Thr 


Leu 


Lys 


Asp 


His 


Arg 


Pro 




4 597 


3 1 3 










3 2 0 










o o c 
3 2 3 










3 3 0 




4 3 y o 


egg 


atg 


gtg 


tgg 


gac 


age 


cag 


gee 


tea 


gag 


cac 


ttc 


ttc 


gag 


tac 


aag 


1 1 1 Q 

1 1 1 O 


,1 c n n 

4 3 y y 


Arg 


Met 


Val 


Trp 


Asp 


Ser 


Gin 


Ala 


Ser 


Glu 


His 


Phe 


Phe 


Glu 


Tyr 


Lys 




4 £T A A 

4 60 0 










335 










340 










345 






4601 


aag 


age 


cgc 


agt 


ggg 


agg 


cac 


gtc 


gtc 


ttc 


tac 


cca 


acc 


ctg 


aag 


tec 


1166 


4 602 


Lys 


Ser 


Arg 


Ser 


Gly 


Arg 


His 


Val 


Val 


Phe 


Tyr 


Pro 


Thr 


Leu 


Lys 


Ser 




4 b U J 








350 










355 










360 








4 60 4 


ctg 


cag 


gtg 


egg 


ctg 


gag 


ctg 


gee 


egg 


gag 


ctg 


ggc 


gtt 


ggg 


gtc 


tct 


1214 


A c r\ CL 
4 b(J 3 


Leu 


Gin 


Val 


Arg 


Leu 


Glu 


Leu 


Ala 


Arg 


Glu 


Leu 


Gly 


Val 


Gly 


Val 


Ser 




vi c: n c 
4 0 U 0 






365 










370 










375 










A c n "7 

4 O U / 


ate 


tgg 


gag 


ctg 


ggc 


cag 


ggc 


ctg 


gac 


tac 


ttc 


tac 


gac 


ctg 


etc 


t 


i 3 a c\ 
IzbU 


4 o U o 


He 


Trp 


Glu 


Leu 


Gly 


Gin 


Gly 


Leu 


Asp 


Tyr 


Phe 


Tyr 


Asp 


Leu 


Leu 






4 609 




380 










385 










390 












4 blU 


aggtgggcat tgcggcctcc gcggtggacg tgttcttttc 


taagccatgg agtgagtgag 


i o o n 
13 2 0 


,1 C1 1 

4 O 1 1 


caggtgtgaa atacaggect ccactccgtt tgctgtg 












i t r i 

13 3/ 


5735 


<210> SEQ ID NO 


144 
























5736 


<211> LENGTH: 619 


























3 / 6 / 


<212> TYPE: 


DNA 




























57 3 8 


<213> ORGANISM: 


Homo (sapience 


















W \Z > 573 9 


<220> FEATURE: 


























^ r n a n 
3 / 4 U 


<221> NAME/KEY: 


CDS 


























C 1 A ~\ 

3/41 


<222> LOCATION: 


(13) 


. . . (333) 






















574 3 


<400> SEQUENCE: 


144 


























R~7 A A 

3/44 


cttcgactcg ct atg tec act aac aat atg teg gac cca egg agg ccg 


A Q 
4 O 


5745 








Met Ser Thr Asn Asn Met Ser Asp Pro Arg Arg Pro 




5746 










1 








5 








10 








5747 


aac 


aaa 


gtg 


ctg 


agg 


tac 


aag 


ccc 


ccg 


ccg 


age 


gaa 


tgt 


aac 


ccg 


gec 


96 


5748 


Asn 


Lys 


Val 


Leu 


Arg 


Tyr 


Lys 


Pro 


Pro 


Pro 


Ser 


Glu 


Cys 


Asn 


Pro 


Ala 




5749 






15 










20 










25 










5750 


ttg 


gac 


gac 


ccg 


acg 


ccg 


gac 


tac 


atg 


aac 


ctg 


ctg 


ggc 


atg 


ate 


ttc 


144 


5751 


Leu 


Asp 


Asp 


Pro 


Thr 


Pro 


Asp 


Tyr 


Met 


Asn 


Leu 


Leu 


Gly 


Met 


He 


Phe 




5752 




30 










35 










40 












5753 


age 


atg 


tgc 


ggc 


etc 


atg 


ctt 


aag 


ctg 


aag 


tgg 


tgt 


get 


tgg 


gtc 


get 


192 


5754 


Ser 


Met 


Cys 


Gly 


Leu 


Met 


Leu 


Lys 


Leu 


Lys 


Trp 


Cys 


Ala 


Trp 


Val 


Ala 




5755 


45 










50 










55 










60 




5756 


gtc 


tac 


tgc 


tec 


ttc 


ate 


age 


ttt 


gee 


aac 


tct 


egg 


age 


teg 


gag 


gac 


240 
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RAW SEQUENCE LISTING DATE: 

PATENT APPLICATION: US/0 9/74 3,24 7 TIME: 

Input Set : A:\ES.txt 

Output Set: N:\CRF3\08032001\I743247.raw 



08/03/2001 
16:29:04 



E--> 



5757 
5758 
5759 
5760 
5761 
5762 
5763 
5764 
5765 
5766 
5767 
5768 
5769 
5976 
5977 
5978 
5979 
b> 5980 
5981 
5982 
5984 
5985 
5986 
5987 
5988 
5989 
5990 
5991 
5992 
5993 
5994 
5995 
5996 
5997 
5998 
5999 
6000 
6001 
6002 
6003 
6004 
6005 
6006 
6007 
6008 
6009 
6010 
6011 
6012 



Val Tyr Cys Ser Phe lie Ser Phe Ala Asn Ser 

65 70 
acg aag caa atg atg agt age ttc atg ctg tec 
Thr Lys Gin Met Met Ser Ser Phe Met Leu Ser 

80 85 
atg tec tat ctg cag aat cct cag ccc atg acg 
Met Ser Tyr Leu Gin Asn Pro Gin Pro Met Thr 

95 100 
tgataccagc ctagaagggt cacattttgg accctgtcta 
getgetaaac ctgctgcctt cagctgccat cctggacttc 
tgcccccagc tggatagagg gaacctggcc ctttcctagg 
tcctgcctcc cttcccctgc ctgctgctgg gggagatget 
teatttgett tctcgttgaa acctgttgtt aataaagttt 
<210> SEQ ID NO: 149 
<211> LENGTH: 2182 
<212> TYPE: DNA 
<213> ORGANISM: Homo | 
<2 20> FEATURE: 
<221> NAME/KEY: CDS 
<222> LOCATION: (56) 
<400> SEQUENCE: 149 

gcacttcagc ttcccctccc ccggcgccct ctggggctcc 

atg ttc acc age ace ggc tec agt ggg etc tac 
Met Phe Thr Ser Thr Gly Ser Ser Gly Leu Tyr 

15 10 
aag age ctt ctg ctg gtc ccc agt gee etc tec 
Lys Ser Leu Leu Leu Val Pro Ser Ala Leu Ser 



Arg Ser Ser Glu Asp 

75 

ate tct gec gtg gtg 
He Ser Ala Val Val 
90 

ccc cca tgg 

Pro Pro Trp 
105 

tccactaggc ctgggctttg 
cctgaatgag gccgtctcgg 
gaacacccta ggcttacccc 
gtccatgttt ctaggggtat 
ttcactcag 




. . ( 1090 ) 



20 



25 



etc ctg cct cac tgc cag aag etc ttt gtg tat 
Leu Leu Pro His Cys Gin Lys Leu Phe Val Tyr 

35 40 
aag aac gac ttc cag att tgg agg ttg ata tgt 
Lys Asn Asp Phe Gin He Trp Arg Leu He Cys 

50 55 
ctt gat ttg aaa gat act ttc tgc agt agt ctg 
Leu Asp Leu Lys Asp Thr Phe Cys Ser Ser Leu 
65 70 75 

agg ata ttt gaa aga aga tat gga age aga aaa 
Arg He Phe Glu Arg Arg Tyr Gly Ser Arg Lys 

85 90 
ctg ggt tec tgg gtt ttg tea gec tta ttt gac 
Leu Gly Ser Trp Val Leu Ser Ala Leu Phe Asp 

100 105 
get atg cag tat ttc ttt ggc ate act gca get 
Ala Met Gin Tyr Phe Phe Gly He Thr Ala Ala 

115 120 
gqa ttc ctg gca cct gtg ttt get ctg ttt gta 
Gly Phe Leu Ala Pro Val Phe Ala Leu Phe Val 
130 135 



gagcccggcg ggacc 

aag gcg cct ctg teg 
Lys Ala Pro Leu Ser 
15 

etc ctg etc gec etc 
Leu Leu Leu Ala Leu 
30 

gac ctt cac gca gtc 
Asp Leu His Ala Val 
45 

gga aga ata att tgc 
Gly Arg He He Cys 
60 

ctt att tat aat ttt 
Leu He Tyr Asn Phe 
80 

ttt gca tec ttt ttg 
Phe Ala Ser Phe Leu 
95 

ttt etc etc att gaa 
Phe Leu Leu He Glu 
110 

agt aat ttg cct tct 
Ser Asn Leu Pro Ser 
125 

cca ttt tac tgc tec 
Pro Phe Tyr Cys Ser 
140 



340 ) >^ 



390 
4 50 
510 
570 
619 



(58/ 

HT3 



151 



199 



247 



295 



343 



391 



4 39 



487 
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RAW SEQUENCE LISTING DATE: 08/03/2001 

PATENT APPLICATION: US/09/74 3,24 7 TIME: 16:29:05 



Input Set : A:\ES.txt 

Output Set: N:\CRF3\08032001\l743247.raw 



6013 


ata 


cca 


aga 


gtc 


caa 


gtg 


gca 


caa 


att 


ctg 


ggt 


ccg 


ttg 


tec 


ate 


aca 


5 3 5 


6 014 


lie 


Pro 


Arg 


Val 


Gin 


Val 


Ala 


Gin 


lie 


Leu 


Gly 


Pro 


Leu 


Ser 


lie 


Thr 




6015 


14 5 










150 










15 5 










Ibu 




6016 


aac 


aag 


aca 


ttg 


att 


tat 


ata 


ttg 


gga 


ctg 


cag 


ctt 


ttc 


acc 


tct 


ggt 


C Q -J 

DO 3 


6017 


Asn 


Lys 


Thr 


Leu 


I le 


Tyr 


lie 


Leu 


Gly 


Leu 


Gin 


Leu 


Phe 


Thr 


Ser 


Gly 




6018 










16 5 










170 










17 5 






6 019 


tec 


tac 


ate 


tgg 


att 


gta 


gee 


ata 


agt 


gga 


ctt 


atg 


tec 


ggt 


ctg 


tgc 


b 3 1 


6020 


Ser 


Tyr 


lie 


Trp 


lie 


Val 


Ala 


lie 


Ser 


Gly 


Leu 


Met 


Ser 


Gly 


Leu 


Cys 




60 21 








180 










185 










190 








60 2 2 


tac 


gac 


age 


aaa 


atg 


ttc 


cag 


gtg 


cat 


cag 


gtg 


etc 


tgc 


ate 


ccc 


age 


d / y 


6023 


Tyr 


Asp 


Ser 


Lys 


Met 


Phe 


Gin 


Val 


His 


Gin 


Va 1 


Leu 


Cys 


lie 


Pro 


Ser 




60 24 






195 










200 










205 










60 2 5 


tgg 


atg 


gca 


aaa 


ttc 


ttt 


tct 


tgg 


aca 


ctt 


gaa 


ccc 


ate 


ttc 


tct 


tct 


7 2 7 


6 0 2 6 


Trp 


Met 


Ala 


Lys 


Phe 


Phe 


Ser 


Trp 


Thr 


Leu 


Glu 


Pro 


lie 


Phe 


Ser 


Ser 




6027 




210 










215 










2 20 












6028 


tea 


gaa 


ccc 


acc 


age 


gaa 


gee 


aga 


att 


ggg 


atg 


gga 


gee 


acg 


ctg 


gac 


7 7 5 


60 2 9 


Ser 


Glu 


Pro 


Thr 


Ser 


Glu 


Ala 


Arg 


lie 


Gly 


Met 


Gly 


Ala 


Thr 


Leu 


Asp 




6030 


225 










230 










235 










240 




60 31 


ate 


cag 


aga 


cag 


cag 


aga 


atg 


gag 


ctg 


ctg 


gac 


egg 


cag 


ctg 


atg 


ttc 


o 2 3 


6032 


lie 


Gin 


Arg 


Gin 


Gin 


Arg 


Met 


Glu 


Leu 


Leu 


Asp 


Arg 


Gin 


Leu 


Met 


Phe 




6033 










245 










250 










255 






60 34 


tct 


cag 


ttt 


gca 


caa 


ggg 


agg 


cga 


cag 


aga 


cag 


cag 


cag 


gga 


gga 


atg 


87 1 


6035 


Ser 


Gin 


Phe 


Ala 


Gin 


Gly 


Arg 


Arg 


Gin 


Arg 


Gin 


Gin 


Gin 


Gly 


Gly 


Met 




6036 








260 










26 5 










270 








60 37 


ate 


aat 


tgg 


aat 


cgt 


ctt 


ttt 


cct 


cct 


tta 


cgt 


cag 


cga 


caa 


aac 


gta 


919 


6038 


lie 


Asn 


Trp 


Asn 


Arg 


Leu 


Phe 


Pro 


Pro 


Leu 


Arg 


Gin 


Arg 


Gin 


Asn 


Val 




6039 






275 










280 










285 










604 0 


aac 


tat 


cag 


ggc 


ggt 


egg 


cag 


tct 


gag 


cca 


gca 


gcg 


ccc 


cct 


eta 


gaa 


Ci C 1 


6041 


Asn 


Tyr 


Gin 


Gly 


Gly 


Arg 


Gin 


Ser 


Glu 


Pro 


Ala 


Ala 


Pro 


Pro 


Leu 


Glu 




604 2 




290 










295 










300 












604 3 


gtt 


tct 


gag 


gaa 


cag 


gtc 


gec 


egg 


etc 


atg 


gag 


atg 


gga 


ttt 


tec 


aga 


i a i r 
ID ID 


604 4 


Val 


Ser 


Glu 


Glu 


Gin 


Val 


Ala 


Arg 


Leu 


Met 


Glu 


Met 


Gly 


Phe 


Ser 


Arg 




604 5 


305 










310 










315 










320 




604 6 


ggt 


gat 


get 


ttg 


gaa 


gec 


ctg 


aga 


get 


tea 


aac 


aat 


gac 


etc 


aat 


gtc 


106 3 


604 7 


Gly 


Asp 


Ala 


Leu 


Glu 


Ala 


Leu 


Arg 


Ala 


Ser 


Asn 


Asn 


Asp 


Leu 


Asn 


Val 




604 8 










325 










330 










335 






604 9 


gec 


acc 


aac 


ttc 


ctg 


ctg 


cag 


cac 


tgatagtccc aggccaacac tgg 




1110 


6050 


Ala 


Thr 


Asn 


Phe 


Leu 


Leu 


Gin 


His 




















6051 








340 




























6052 


gaccggaccg geagecgagt gacagtgcgt 


ggtccccacc 


atcagatcag cccggggacc 


1170 


6053 


gagcatctct ggtgctgatg ttcttgtggg 


aagagggagg 


ttccaccgca cccctgccct 


1230 


6054 


caaccgcaag actgttgccg ttttagtgtg 


gagataagtt 


tgecattaca ttagcatgta 


1290 


6055 


ttttctatct atatttttta ttgggcattt 


tccctaggtt 


ggagagtcag cactcgtttt 


1350 


6056 


gaatgtgttt aaaatgeatt aaaatggaag 


atttctgeag 


gcagttgaat ggcactccag 


1410 


6057 


atggggaatt gctgtaaccc tcttactgta 


acatgtcatc 


tcctgcgtcg tgatggggag 


1470 


6058 


agggtaatgt tacttcacaa aggacatgtc 


agatccttct 


tcatggactt ttttagttac 


1530 


6059 


tgttttttct ctcaaacttg ttttcgaatc 


tcctgggagt 


gagggagaaa cagggagctg 


1590 


6060 


aatcctcccc caagctgttc caggecagag 


gaetctgeag 


taccttctcc tacatctagt 


1650 


6061 


aacaaagaat ggtgataacc atgcactggt 


tcaaggttct 


ggagttctcc atgaaacttg 


1710 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/0 9/74 3,24 7 



DATE: 08/03/2001 
TIME: 16:29:05 



Input Set 
Output Set 



A:\ES.txt 

N:\CRF3\080 3 2001\I74 3 24 7. raw 



6062 ggttaatttt gctcagagta tccggagtta gccactaggc tgcgggtgaa atgggatgga 1770 

6063 gtagaacaac agcaggcttc ctggagccac atgggctgac tagggcactc tgtggctggc 1830 

6064 ctggcacggg ctcagcccag gaagaggaga aacgatccct tgcctgcccc tccctgtggc 1890 

6065 agggctaact gcctggccct cctggctcgc agccagccag ccccctggca gcaggttctc 1950 

6066 ctcagggctt gggtcttcaa cctgtggcga caggaggcag ggcagactgt ggaggacagg 2010 

6067 atgcaggtca gggagaggga aggcaggggt ggaccgccat gagcatgaaa agacccgaag 2070 

6068 caagttgact cttgcaatgt gcaactgtta tgttctgcaa aatgagcaac gatgtatcaa 2130 

6069 attgatgcaa atttagatgt tgatacttac aataaagttt ttaatgtgtt tt 2182 
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VERIFICATION SUMMARY DATE: 08/0 3/2001 

PATENT APPLICATION: US/09/74 3,24 7 TIME: 16:29:06 

Input Set : A:\ES.txt 

Output Set: N:\CRF3\08032001\I743247.raw 

L:10 M:270 C: Current Application Number differs, Replaced Current Application No 
L:10 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:523 M:283 W: Missing Blank Line separator, v220> field identifier 
L:550 M:254 E: No. of Bases conflict, LENGTH : Input : 4 50 Counted:449 SEQ:21 
M:254 Repeated in SeqNo=21 

L:553 M:252 E: No. of Seq . differs, <211>LENGTH : Input : 510 Found:509 SEQ:21 
L:559 M:283 W: Missing Blank Line separator, <220> field identifier 
L:601 M:283 W: Missing Blank Line separator, <:220> field identifier 
L:674 M:283 W: Missing Blank Line separator, <220> field identifier 
L:740 M:283 W: Missing Blank Line separator, <:220> field identifier 
L:778 M:283 W: Missing Blank Line separator, <:220> field identifier 
L:827 M:283 W: Missing Blank Line separator, <220> field identifier 
L:888 M:112 C: (48) String data converted to lower case, 
L:910 M:283 W: Missing Blank Line separator, <220> field identifier 
L:946 M:283 W: Missing Blank Line separator, <220> field identifier 
L:986 M:283 W: Missing Blank Line separator, <220> field identifier 



L 


1618 


M 


283 


W : 


Missing 


Blank 


Line 


separator, 


<220> 


field 


identifier 


L 


1690 


M 


283 


W : 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


17 4 9 


M 


2 83 


W : 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


18 5 3 


M 


2 8 3 


W : 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


1889 


M 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


1942 


M 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


2025 


M 


283 


W: 


Missing 


Blank 


Line 


separator, 


<220> 


field 


identifier 


L 


2085 


M 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


2121 


M 


283 


W: 


Missing 


Blank 


Line 


separator, 


<220> 


field 


identifier 


L 


2169 


M 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


2780 


M 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


2858 


M 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


2911 


M. 


283 


W: 


Missing 


Blank 


Line 


separator, 


<220> 


field 


identifier 


L 


2997 


M: 


283 


W: 


Missing 


Blank 


Line 


separator, 


<220> 


field 


identifier 


L 


3057 


M: 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


3099 


M: 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


3189 


M: 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


3 2 2 7 


M: 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


3373 


M: 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


3412 


M: 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


3565 


M: 


340 


E: 


(46) " n ' 


or "Xaa" used: Feature required, 


for SEQ ID# : 93 


L 


4104 


M: 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L 


4192 


M: 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L : 


4270 


M: 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L: 


428 5 


M: 


258 


W: 


Mandatory Feature missing, <2 


23> not 


found 


for SEQ ID# : 113 


L : 


4285 


M: 


341 


W: 


(46) n 


or "Xaa" used, for SEQ ID# : 


113 




L : 


4 3 3 2 


M: 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L : 


4 414 


M: 


283 


W: 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L : 


4529 


M: 


283 


W : 


Missing 


Blank 


Line 


separator , 


<220> 


field 


identifier 


L : 


4541 


M: 


254 


E: 


No. of Bases conflict, LENGTH 
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